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Chapter 1 


Introduction 


The Goldbach Conjecture is one of the most fascinating unsolved problems in 
number theory. It was (famously) first posed by Christian Goldbach in his 1742 
letter to Leonard Euler. His original question was whether all integers greater 
than 5 are representable as a sum of three primes. The problem immediately 
reduces to asking whether all even numbers greater than 2 are representable as 
a sum of two primes. 

Although it is still an open conjecture to show that all even numbers are 
expressible as a sum of two primes, the case for odd numbers is easier. It 
was shown in 1937 by I. M. Vinogradov that all sufficiently large odd integers 
are expressible as a sum of three primes. Vinogradov proved the three-primes 
theorem by analytical means, using a major arc/minor arc decomposition. The 
major arcs are treated by estimating the sum 


Hex) = Y x(n)A(n). 
naz 

The proof of this estimate relies on a large portion of classical theory of 
Dirichlet L-functions, and has as an immediate corollary the corresponding es- 
timate for n(x, a, q), the number of primes less than x equivalent to a mod q (a, 
q relatively prime). 

The minor arcs are treated by Vinogradov’s type I/type II sum decomposi- 
tion. Such a decomposition provides bounds on the sum 


5 A(n)e(na) 


which are sensitive to a. In this essay we work with the möbius function instead 
of the Von Mangoldt function, and apply a significant amount of harmonic 
analysis to achieve our results. 

To prove the three-primes theorem, we work with the sum 


r(n)= Y A(ki)A(k2)A(ks), 


k1i+k2+k3=n 


which counts representations of n as a sum of three prime powers, with 
a weight of log(pi) log(p2) log(p3) attached to each such representation. The 
contribution to this sum deriving from proper prime powers is small compared 
to the contribution from primes themselves (use partial summation). If there 
are no such representations, r(n) = 0. For the proof, we bound r(n) away from 
zero for odd n. We now state 


Theorem 1.0.1 (Vinogradov). For any fixed A > 0, 
1 N? 
x)= lero (E 


where 


se (gt) a) 


p|N piN 


If N is even, the above is not useful. Indeed, assume N is even. Then one 
of the k; is a power of two. If we consider the number of representations of N 
as a sum of a power of 2 and two integers, this gives a trivial upper bound of 
r(N) = O(N log*(N)). When N is even, 6(N) = 0 in the above, and the bound 
given by Vinogradov is considerably worse than the trivial one. 

For N odd, we have 


(1-53) =.6602..., and I+ ga) =2.301..., 


pA2 


and so 1.320... < G(N) < 2.301.... Hence, the N? term eventually domi- 
nates the second term and hence gives a lower bound for r(N). 

The remainder of the introductory chapter of this paper is devoted to the 
proof of Vinogradov's theorem modulo 3 key lemmas. The first should really 
not be called a lemma, but a theorem, for it is the bound on w(, X), equivalent 
to the prime number theorem for arithmetic progressions. For this estimate, 
we follow a hybrid of Green's notes [Gr1] and Davenport's classical book [Dav]. 
The second is a basic fact about Ramanujan sums. The proof of these two 
facts forms chapter 2. The third lemma is an estimate of the exponential sum 
Y u(nje(na) originally by Davenport, however we follow a proof of Green and 
Tao [GT], with some harmonic analysis from Montgomery [Mo]. The proof of 
this lemma makes up chapter 3. The precise statements of all of these can be 
found in the remainder of this chapter. 


1.1 Major and Minor Arcs 


We now proceed with the proof of the main theorem. To begin, consider the 
exponential sum, 


Then 


S(a)? = XO Alki) A(k2)A(ka)e((ka + ko + ks)a) 


k1,k2,k3<N 


= El Y) AYAIA(s) | elna) 
n k1+k2+k3=n 
ki, ko, ka N 


= 5 r(n, N)e(na) 
Note that r(n) = r(n, N) for n < N. Hence, we see that $(a)? is a Fourier 
series, and thus the coefficients r(N) can be recovered 


r(N) = S(a)e(—Na) da. (1.1) 
R/Z 

Bounding this integrand from below with therefore give us a bound on r(n) 
and Vinogradov's theorem. The integrand of (1.1) turns out to be large when a 
is near a rational number with small denominator, and small when far from one. 
Hence, we proceed by a major arc / minor arc decomposition which is common 
in such problems of additive number theory. 

We divide R/Z into subintervals M (major arcs) and m (minor arcs). The 
major arcs correspond to subintervals close to a rational number with small 
denominator, and the minor arcs their complement in R/Z. We make these 
notions precise as follows. Let P = log? (N), and Q = N/log??(N), where B 
will be chosen later in terms of A (see A from Vinogradov’s theorem, above). 
For all q < P, and all 1 < a < q with (a, q) = 1, define 


1 
Mía, q) = {a € R/Z such that |a — A so 


And then let M be the union of all the M(a, q). Let m be the complement 
of M in R/Z. 


Observation 1.1.1. m is non-empty. 


Proof: 
Any two of the M(a,q) are disjoint. Indeed, taking a/q 4 a'/q', we see for 
large enough N that 


1 


1 1 2 
ae aus cee 


a 


q q 
Hence M is not all of R/Z. 


/ 


We now carry out a separate analysis for each of M and m. 


1.2 Proof of Vinogradov’s Theorem 


We start with an individual M(a, q). Take the Gauss sum 


meZ/qZ 


where x is any character to modulus q. We then have that 
1 d 
wa Y” x(n)r(X) =0  if(n,q)>1 


oq) — 
XEZ/qZ 


because then x(n) = 0 for all n. If (n, q) = 1, then 


g Exodo = q Y Y ONES 


xEZ/qZ xeZ/q2 "EZ/9Z 


O 


a a) E 


where in the second line we have taken m = nh mod q. With reference to 
Mía, q), we now take a = a/q + 3. We then have 


ka 


Sa) = > A(k)e(— )e(kB) + O(log’ (N) 
k<N 
(k,q)=1 
fe 5a SCAK) Y x(ka)r(X)e(k8) + O(log?(N)) 
D ZN xeZ/az 
z aa) Y r(x)x(a) > x(k) A(k)e(k8) + O(log?(N)). 
x€Z/ ql k<N 


Taking the inner sum of the above and applying summation by parts we get 


N 
Y x(k)A(K)e(KB) = e(N BUN, x) — 208 | e(ubip(u,x) du. (1.2) 


k<N 


Where 


(a,x) = Y x(n)A(n) 


n<x 
as in the introduction. We analyze the trivial character xo separately. We now 
need our first 


Lemma 1.2.1. Let x 4 xo be any character to modulus q. If q < log Y (x) for 
some positive constant M, then we have 


[sb a, x)| = O(ze" COD vt) (1.3) 

for some positive constant C(M) which depends only on M. 
As noted above, this “lemma” is essentially equivalent to the prime number 
theorem for arithmetic progressions (and correspondingly makes up a significant 


portion of this paper). 
Combining the lemma with formula (1.2) we get 


XO x(k) A(k)e(k3) = O((1 + JBN) Ne ev 98%), 
k<N 


Now we estimate the contribution from the trivial character xo. Recall that 
(x) is defined 


d(x) = Y Aln) 


as in the prime number theorem. Because xo(n) = 1 for (n,q) = 1 and 
xXo(n) = 0 for (n, q) > 1, v(x, xo) resembles W(x), precisely, 


lls, xo) — ¥(a)| < Y An) = O(log(q) log(z)). 


(n,q)>1 


Recall that the prime number theorem gives 


p(x) =x + O(xe7eV"80)), 


the proof of which is simpler and follows the same form as our lemma 1.1. 
Hence, setting w(u, Xo) = lu] + R(u), we have 


R(u) = O(ue~°V 8), (1.4) 


Similar to the origin of (1.2), we set 


T(8)= Y e(kb), 


1<k<N 


and apply summation by parts to get 


N 
T(B) = e(NB) — 2118 f e(uB) |u| du 


Combine this with (1.2) and then apply (1.4) in the second line below to get 


N 
YE xolk)A(k)e(KB) = T(8) + e(NB)RW) -rig f e(ub)R(u) du 


k<N 


= T(6) + O((1 + |EN) Nev), 


Therefore, the sum on the left for the trivial character only differs from the sums 
of the other characters by the term T(8). We have hence reduced our sum to 


1 


S(a) = wD 


7(Xo0)(T(8) + O((L + IBIN) NE VIE) + Y r(X)x(a) - O((1 + [BIN Nev 8%?) 
X#FXO 
(1.5) 
Two quick facts about Gauss sums will reduce this expression further. 


Observation 1.2.2. T(Xo) = u(q) 
Proof: 


TE Y ume (=) 


= q 
(e) 
_ pees 
m<q q 
(m,q)=1 
2 md 
= Lug D (=) 
d=1 1<m'<q/d q 
= pla) 


Where we have set m'd = m. 


Observation 1.2.3. |7(x)| < q? 


Proof: 
Recall that if x is primitive, 


hence 


S hnftror= Y D rex) D e( M44) 


nEZ/qZ h1€Z/qZ h2€Z/qZ n€Z/qZ 
Therefore 


ADP =a Y xh? = asta). 


h€Z/qZ 


Hence 


1 

Tod] =q? 

if x is primitive. In the case of x imprimitive, one can show that T(x) < q? 
however, we omit this proof for sake of brevity. 


y) 


These two facts applied to (1.5) yield 


S(a) = MD 73) + O((14 IBIN)q? NeW eV 28), 


Now one applies the fact that our a lies in the major arc M(a,q). Using 
q < P, and |8| < 1/Q one obtains, 


Sla) = EDT) + o(werevH™), 
q 
thereby making significant improvement to the error term. We now cube 


and take the fourier series to find the contribution of M(a, q) to the integral 
(1.1) we originally set out to study; 


a)’e(—Na) da = Hq) —aN 1/Q DOE 2.0 RA 
faa ) ( à Me < plq)’ ( q fo e( NB) dB+O(N e ). 


Summing over all major arcs, we have 


q a 1/Q , 
a S(a)%e(—Na) da = 5 ae 5 e (=) Ta T(8\e(—N B) dB+O(N7e~° leet Ny 
asp pai a 6) 


The inside sum 


is known as Ramanujan’s sum. To evaluate the first factor in (1.6), we 
employ our second 


Lemma 1.2.4. Let c¿(n) be the Ramanujan sum defined above. 


1. The sum c¿(n) is multiplicative. 


2. If p% is the highest power of p dividing n, c¿(n) is given on prime powers 
by 


op") if B<a 
cp(n)=4-p®* if B=atl1 
0 ifB>atl 


The proof of which is actually rather short. Nonetheless, we confine it to 
Chapter 2. Now observe that all of the functions in sum on the first factor of 
(1.6) are multiplicative, and thus we may evaluate by euler product, 


c pg) A _ _@(N) 
2 gp E IIQ 1) 


- Ty) 0+ gap) 
= G(N). 


From the trivial estimate |c,(n)| < p(q), we find that 


Hence 


y ula) (Ny = G(N) + O(log"? + (N)). (1.7) 


Now we evaluate the factor in (1.6) involving the integral. We have 


1/Q 1 1-1/Q 
/ T(8)8e(—N8) dp = i T(8)%e(-NA) dB +0 dl Iro)? is) 
0 1/Q 


-1/Q 


We observe that T(3) is the sum of a geometric series by definition, and 
hence it is evaluated 


e((N + 1)8) — e(8) : 1 
T(P) = = O(min | N, 
e(8) -1 118ll,z 
where ||- ||r/z is the distance of - to the nearest integer. We thus have 


1/Q 1 N2 

JU TOPA-NB B= f TOPNO) = f rwrana aso (e) 

-1/Q 0 log“? (N) 
By analogous construction to the beginning of section 1.1, the integral on 

the right of this equation is equal to the number of ways of representing N as a 

sum of three arbitrary integers, N = kı + ka + kg, hence 


f T(G)%e(—NB) dB = E(N 1)(N — 2) = Ly? + O(N). 


Therefore, 
Er —Ngra6=N?+0 mE 1.8 
[SP o Tm) 8) 
Combining (1.6),(1.7) and (1.8), we get that 
l 2 m) 
L S(a e(—Na) da = ¿6(N)N o . (1.9) 


Vinogradov's theorem follows if we can show that the minor arcs contribute 
a smaller amount. We now proceed to their treatment. Take 


J S(ae(—Na) da 


A 
SS 
sE 
al 
= 
$ 
TL, 
z 
ER 
Q 


A 
E 
E 
22 
=; 
2 
z 
Q 


The integral evaluates as 


/ ¡Sada = Y Ak) Y Alta) | iaa 
0 < 0 


ki<N ko<N 
= SoA? = O(N log(N)). 
k<N 


We must now bound S(a) itself on m. We have 


1 log??(N 
a) _ log" (N) 


Q = 
> N 


for all q < P if a € m. Hence 
i a log?” (N) 
1.1 
RA a N ony 


We now evoke our third 


10 


Lemma 1.2.5. Leta € R, and C > 0. If for sufficiently large N, and any fixed 
k we have 


log28(0+4) ( y 
inf Idallayz > ge OE 
1<d<16 log®(C+4) (N) N 
then 
1 = 
lv SY uínje(na)| = O(log” °*!(N)) 
1<n<N 


which essentially says that œ € m causes u(n) to be asymptotically orthogo- 
nal to e(na). A statement about the móbius function is easily transformed into 
a statement about the von Mangoldt function. We have 


A(n) = Y log(d)u (3) l 
din 


Taking u(x) = 0 if x ¢ Z, we may write 


A(n) = Y log(d)q (5) l 


d<n 


Hence 


Our minor arc condition (1.10) implies 


log*? ( ) 
inf >. 
sega ay lien > ESS 


Taking B = 9(C + 4) satisfies the conditions of the lemma. We then may 
evaluate 


Y los(a) — A” 


S(a) < Rae 
5<N log? ~°(N/6) 
1 N 
< 5 
log (N) £y î 
log? ~"(N) 


11 


Combining this estimate with the above, we then have 


[ soya) da =O 5 (1.11) 


Upon taking B = 9(A + 6) and combining (1.9) and (1.11) we have proved 
Vinogradov’s theorem. A 

In the next chapter we treat lemmas (1.2.1) and (1.2.4), which are used to 
bound the major arcs above. The proof of lemma (1.2.1) makes up the bulk 
of the chapter. We mainly follow Davenport for chapter 2. In chapter 3 we 
prove lemma (1.2.5) used to bound the minor arcs. The treatment here is more 
contemporary, following a paper of Green and Tao on the quadratic uniformity 
of the móbius function. 


12 


Chapter 2 


Major Arcs 


2.1 Explicit Formula 


For the bulk of this chapter we work to derive good bounds on the sum 


ylz) = Y x(n) A(n). 
nír 
For each modulus q there are a collection of ¢(q) such sums, and the prime 
number theorem in arithmetic progressions quickly follows from knowing good 
bounds on these. In the Dirichlet convolution algebra we have A = log *p. 
Because Dirichlet characters x are completely multiplicative, we also have 


x(n)A(n) = X (x(d) log(d)) - (x (3) H (5)) 
din 


hence yA = xlog xx. Convolution in the Dirichlet algebra becomes multipli- 
cation of the corresponding Dirichlet series, 


s AMA) (>. ses) y XM) 
A A E YE 
_ Pls 
L(s,x) 


valid for Re(s) > 1. From the above, we see the connection between our 
sum Y(x, x) and Dirichlet L-functions. To obtain a partial sum from such a 
series, we work with the L-functions and employ a variant of Perron’s formula 
and obtain the formula 


ylz, x)=- Y de) — Y $a- 2m) +05 


[p[<T m=0 


eo (2.1) 


13 


Where the first sum is over all nontrivial zeros of L(s, x), ¢ is a smooth 
cutoff function, and db is its Mellin transform, which we define and discuss later. 
Note Dirichlet L-functions have slightly different forms for “odd” and “even” 
characters. The difference is realized in the above formula via our definition of 


ofo ifx(-1)=1 
1 if y(-1) =-1. 


Where in the first case the character is called “even” and the second it is 
called “odd”. Equation (2.1) is called the explicit formula for the Dirichlet 
L-function. It is particularly important because it directly shows the duality 
between prime numbers (i.e. the sum (a, x)) and non-trivial zeros of the L- 
function (i.e. the first sum on the right side of (2.1)). This connection is a 
guiding principle of analytic number theory. 

From this point forward, we derive a bound on 4(x,x) by bounding each 
term in the explicit formula. In the remainder of this section we show how the 
explicit formula is derived and set up an approximate formula where we restrict 
to non-trivial zeros with imaginary part less than T. In the next two sections 
(2.2) and (2.3), we study N(T, x) (that is, the number of zeros of L(s, x) in the 
critical strip with imaginary part less than T), and zero-free regions of L(s, x). 
These allow us the bound the first term in the above, and are essentially the 
most we can say about the location of the all-important critical zeros. In section 
(2.3) we encounter the complication of Siegel zeros - which may or may not exist 
- and spend the following section (2.4) bounding their contribution as best we 
can via Siegel’s theorem. Finally, in section (2.5) we evaluate all of our Mellin 
transforms, and obtain our final estimate for y(x, x). The brief proof of lemma 
(1.2.4) on Ramanujan Sums makes up section (2.6). 


2.1.1 The Mellin Transform 


We want to analyze the sum w(x, x). However, to apply analytic techniques, 
it is often more appropriate to reduce to a smoothed function. Therefore, we 
take the function ¢(n) to be any C function on R, compactly supported on a 
subset of (1,00). Instead of y(x, x) we work with 


vale, x) = Y x(n)A(n)d(n). 
We prove the explicit formula for 4V¿(x,x), and then reduce to (2.1) later. 


The proof of the explicit formula works extensively with Mellin transforms, 
which for s € C we define 


is) = [Par E, 


the Mellin transform of ¢. Because ¢ has compact support, db is analytic. 
The Mellin transform is analogous to the Fourier transform, except with respect 


14 


to the group Rt with multiplication. We have that x* is a character for this 
group, and that dz/x is an invariant measure. In like fashion to Fourier, we 
have the 


Proposition 2.1.1 (Mellin Inversion Theorem). Let ¢: R — R be any CS 
function with compact support contained in (1,00). Then for o € R we have 


aot+ioo 7 
(a) = 5 / ¿(s)a7* ds. 


Ont A 
Proof: 
Follows from the Fourier inversion theorem and a change of variables. 


We know that in fourier analysis, smoothness of a function is manifested in 
the decay of its fourier transform. Similar properties hold with Mellin trans- 
forms. In fact, if @ were analytic, then $ would have exponential decay in vertical 
strips (i.e. in the tioo direction), see Titchmarsh [Tit]. However, analytic func- 
tions which approximate the characteristic function of an interval do not exist. 
We must instead use the following lemma which applies to C% functions and 
gives a slightly weaker result. 


Lemma 2.1.2 (Mellin Decay). Suppose that 6 : R — R is a CS function 
supported in the interval [X,2X] for some X > 1. If oi < 02 are real numbers, 
we have the estimate 


m 
($o + it)| < Cm(or, 02) X X XID Alle 
j=0 
uniformly for o € |01, 02]. For fixed m, Cm(01, 02) grows at most polynomi- 
ally in |oi| + 02). 


The lemma more or less states that db decays faster than any polynomial 
in vertical strips, where the constants depend on the order of polynomial, the 
support of ¢ and the size of its derivatives. 


Proof: 
Follows from the corresponding lemma about Fourier transforms. 


2.1.2 Perron’s Formula and application to A(n) 


We now apply Mellin inversion and the regularity results above to derive a 
smoothed version of Perron’s formula. 


Proposition 2.1.3 (Perron’s Formula with smooth cutoff). If (bn) is an arith- 
metic sequence with Dirichlet series f(s) convergent for Re(s) > so, and @: 
R=R is CS and compactly supported on some subset of (1,00), then 


co a+ioo 
am) tas 


2ri Joio 


15 


fora > sy. 


Proof: 
We have 
co 1 co a+io _ 
bn = EN bn z 
Lra = 3D fas 
1 ao+too oo b A 
TLO 


Where the interchange of the summation and integration is justified wherever 
o > 00. 


Applying the formula to w(x, x) we have 


1 pe ea 
ENEA 2 d. 
2mt a—ioco L(s, x) As) i 


Y x(m)A(m)o(n) = 


To get the explicit formula for p(x, x) we evaluate the above by contour 
integration. 


Proposition 2.1.4 (Explicit Formula with smooth cutoff ¢). If ¢ is a CS 
function compactly supported on a subset of (1,00), we have 


bala, x) =- Y d(p) — y ¿(a — 25) 


where a equals 0 or 1 depending on whether x is even or odd, and the first 
sum is over all of the non-trivial zeros p of L(s, Xx) counted with multiplicity. 


Proof: 

We would like to prove the formula for ¢ compactly supported on some 
subset of (1,00). However, it suffices to consider ¢ supported on [x, 2x] for x > 1 
because any other ¢ supported on a different interval may be reconstructed via 
a smooth partition of unity as a finite sum of intervals of this type. Therefore, 
we will be able to use lemma (2.1.2). 

To prove the proposition, we will use a standard contour integration tech- 
nique, integrating around a closed square to the left of the line Re(s) = o and 
then expanding the contour to —oo to pick up the residues of all the poles 
of L'(s,x) - p(s)/L(s, x). Because L'(s, x) and ¢ have no poles, we are only 
concerned with the zeros of L(s,x). All trivial poles have 


de 
Beg, jen ng 


L(s, x) 
and all poles p in the critical strip have residue equal to their multiplicity. 
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We now choose a contour C = C1 U C2 U C3 U C4, with 


Cı = [oo iT, oo +11] 
Ca = [oo tiT,-R+ iT] 
C3 = [-R+iT,-R-iT] 
C4 [(—R — iT, co — iT] 


With R and T later tending to infinity. In picking such a contour, we take 
care to pick R and T such that C3 avoids any of the trivial poles of L’/L (i.e. 
choose R ¢ Z), and that C2 and C4 avoid any of the critical poles. If we define 


1 E $ 
a / 50 (s) ds, 


2m 


The reside theorem states 


L+E+1+1=- Y 60) - Y pla- 2m). (2.2) 


If we let T —= oo, 


Iı — palo, x) = En 


so to complete our proof, we must bound the integrals J4, I2, and /3. Lemma 
2.1.2 provides a bound for ¢. We have the following lemmas which bound L’/L. 


Lemma 2.1.5. If -l<o< 2, 


L'(o + iT, x) 


Moi 7 les (a) 


Proof: 

This proof depends on material from section (2.2.1), however our arguments 
are not circular. Let p = @ + iy be a nontrivial zero of L(s,x). Observation 
(2.2.3) gives that the number of zeros with |y — T| < 1 is O(log(q(|T| + 2))). 
There are then O(log(q(|T'| + 2))) zeros with imaginary parts within an interval 
of length 2. Hence there must be a gap of length > clog(q(|T| + 2)) amongst 
them. Therefore, by varying T by a bounded amount, we have that 


Oe 
log(qT) 


for all zeros. Formula (2.13) from section (2.2.1), states that for s with 
—1 < Re(s) < 2 one has 


pas 
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1 
= Y, = + Ollog(a(\t| + 2))). 
L(s,x) Gs? 
ly-t|<1 
From the above choice of T, we have that each term of the sum is O(log(qT)), 


and from observation (2.2.3) that there are O(log(qt)) terms in the sum. Hence, 
the lemma follows. 


Lemma 2.1.6. For s € C with Re(s) < —1 and excluding a circle around each 
trivial zero of L, we have 


D's, x) _ 


Proof: 


We need the functional equation for Dirichlet L-functions, which we will 
assume without proof, 


T 


o (Dirt tone = 6 


AR 1 1 
2 2 T 


¿5430 +1))L(1—s, x). 


Employing some standard functional equation relations from complex anal- 
ysis (proved by analyzing Hadamard products), 


1 Soyot 
r(s)T(1-— s) = B T(s)T(s + 3) Dd) 
we obtain 
D(3s) ti T 
22 = 1 2218 cos(_s)T(s) 
G- 35) 2 


Applying this to the functional equation, we obtain an asymmetric form 


ONE 
T(X) 


Taking the logarithmic derivative, 


L(l—s,x) = DE 8 93 cos( 5(s —a))P(s)L(s,¥). 


L'(1— 8, x) 


L(1— s,x) = — log(2)—log(m)+log(q) 5 sin(5(s a)) log (cos(F(s a))) 


If we avoid the poles produced by the log(cos(-)), we see that this term is 
bounded. Furthermore we have that the I’’/T term is O(log(|s|)), and that the 
L'/L term is bounded when Re(s) > 2 (see later lemma (2.1.7), we obtain the 
estimate in the lemma. 
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With the bounds given in lemmas (2.1.2), (2.1.5) and (2.1.6) we can bound 
In, Iz and I4. We begin with Ig and I4. With reference to lemma (2.1.2) 
(Mellin Decay), we will later choose ¢ to resemble the characteristic function 
of an interval. Therefore, we will have ||é||z1 = O(x), ||D@||z1 = O(1) and 
||D?4lli1 = O(a). 

For I3 we use lemma (2.1.2) (Mellin Decay) with m = 2, and lemma (2.1.6). 
We have 


eae Head 
——~ | - |d(s)| ds 
Tey | llo) 


T 
= Í O(log(q| — R + itl)) -Crlt|-Pa-** lelle + all Dollz: + 2?11D%]|11) dt 
-T 


|I| 


IA 


27 J rr 


T log(qt) 
t2 
T 


= Os(Cra E? los(a)) f dt 


= Op(Cra "t? log(q)). 


Lemma (2.1.2) (Mellin Decay) gives that Cp is at most a polynomial in R. 
Because we have that x > 1, the integral over C3 goes to 0 as R —> ov. 

We now address I> and 4. We use all three of our lemmas, but take m = 1 
in lemma (2.1.2) (Mellin Decay) instead. 


1 ootiT Ls x) M 
hb|=|L4| < = AL. d 
m=i < pf RER olas 
i ANE iT, 3 1 f° |L'(o +iT, z E 
+3 / eat x) [plo + iT)| do 4 / E _ x) [plo + iT)| do 
tJ_p | L(o +iT,x) 2n Jı | Llo +1T,x) 


—1 oo 
/ +0(log?(4T)) / CL Ta (Vola + all Dolla) do 


7 ER 

7 A ma ( T ) P a 

= Otlog(aR)) | |d(o + iT)| do + Og (= a log w) 

= Og(log(qR)) Š Cı(-R, —1)|T| 1x do + Og (se ean) 


=: 04 (cr) fe RA, (test foe") 


T _R T 
log(q) log(aJaRO; 270-1 log(z) log?(qT) 
= Os ( T R) 4 Os T p 


Where again C' is at most polynomial in R, and hence is controlled by go, 


Both of these terms are seen to go to 0 as T — oo. Drawing together equation 
(2.2) and these last two estimates, we have 
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R/2 
11. ]=- Y 00) — YO ó(a— 2m) + O6(Cre~**" log(a)) 


lelST m=0 


+05 (eo wien +05 A eee?) . 


where the first O term is as R — oo and the second two are as T — œœ. 
Letting R go to infinity gives 


YA a Y dp) = 5 ¿a 2m) Os (= est) log J ; (2.3) 


[p[<T m=0 
Finally, we have that 


Lemma 2.1.7. If oco > 1, ¢ as above, 


1 


Proof: 


L fr LLC 3 
woe) nisi f" FEN]. bolas 


For 07 > 1, we have that |L’ (oo + it)| < ¢’(o0) = O(1). In the same range 
we have the absolutely convergent Euler product 


otiT 


|log(L(s,x))| = 


1 
vee] 1—x(p)p-* 
> | log(1 — x(p)p7*)| 
e 3 2 
aa 
2E O(1). 


P 


IA 


ps) 


=on 


IA 


IA 


So log(L(s, x)) remains bounded independent of t, hence 1/L(s, x) also remains 
bounded as t —> +00. We know from lemma (2.1.2) (Mellin Decay) that ¢ has 
polynomial decay in vertical strips, hence the result. À 
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Combining lemma (2.1.7) with equation (2.3), we have 


pale) = Mhl+Osal/ 18T) == Y) Ho) D> H(a-2m) +04 (Roe a) 
[p[<T m=0 
(2.4) 


and letting T — co yields the proposition. 


Now that we have the explicit formula for a smooth cutoff function ¢ in 
terms of its Mellin transform, we must bound its terms to achieve the lemma 


[b(a, x)| = Owe COM VR) 


which we originally set out to prove. We need some significant results about the 
nontrivial zeros of L(s, x) before we pick f and evaluate the entire expression 
in section (2.5). 


2.2 N(T,x) 


In this section we establish an asymptotic estimate for the number of zeros 
N(T, x) of L(s, x) with imaginary part |t| < T in the critical strip. Throughout 
we assume that x is not the trivial character unless otherwise noted. As one 
might expect, we apply the argument principle from complex analysis. We pick 
the contour R= R¡ U R2 U R3 U R3 to be 


9 o. 
Ri = E Zar 
5, 3, 
Roa = [i +ir -3 +a] 
3 ; 
Rg = Sa tiha T 
r = [m3 


Rather than work with L(s, x) itself, it is more appropriate to work with the 
variant 


rGs+ 5a)L(s,x) (2.5) 


because of the simplified functional equation 


(s, x) = e 


T 


ae 
iq? 


roge) 


¿a 7i s,X) = 


21 


J 


Recall that T(x) is the Gauss sum. Recall the lemma of chapter 1 gives 
\r(x)| = q2 for primitive x, hence the multiplying factor in the above equation 
has absolute value 1. In the definition of €(s, X), the poles of I cancel with the 
trivial zeros of L(s, x), hence, the only possible zeros are those in the critical 
strip. We have 


E) 
2i R Els, xX) i 


Because of the symmetry in the functional equation for £(s, X), 


NCE, x) = 


s = = (Imlog(é(s,x)))n- (26) 


Im log(€(o + it, x)) = Imlog(€(1 — o — it, X)) + c, 


we have that the left hand side and right hand side of R have the same 
contribution to (2.6). We evaluate each term of £(s, x) over the half-contour 
H = [1/2 — iT, 3/2 — iT] U [3/2 — iT, 3/2 + iT] U [3/2 + iT, 1/2 + iT]. We have 


(tutos (3) ant = T log (2) 


for the first term. Note our notation for evaluation along H. For the next 
term of £(s, x), we evoke Stirling’s formula for the gamma function 


Lemma 2.2.1 (Stirling’s Approximation). If Re(s) > 0, 
log(T(s)) = slog(s) — s + O(log(s)). 


Proof: 
See Whittaker and Watson, Chapter 12 [WW]. 


This gives us 


1 1 T 
ImlogP | =s + =a] =Tlog| — ) —T+ O(log(T)). 
a Ae 2 


It remains then to evaluate Imlog(L(s, x)) along H. For this, we will need 
some basic results about L(s, x). 
2.2.1 Hadamard Product for £(s, x) 


We know from undergraduate complex analysis that entire functions are deter- 
mined (up to e/)) by their zeros. This notion is made precise via the Hadamard 
product representation of a function. Recall that the order of growth po of an 
entire function g is defined to be the infimum over all p such that 


Ig(2)| < AeP I, 


Then we have 
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Lemma 2.2.2 (Hadamard Product). Suppose that f is an entire function and 
has growth order po. Let k bet the integer so that k < po <k+1. Ifaj,az,... 
denote the (non-zero) zeros of f, then 


Ez P(z),m E 2 
flay = P=" IT (E), 


where P is a polynomial of degree < k, m is the order of the zero of f at z =Q, 
and Ex(z) are the canonical factors, defined 


Exlz) =(1- ¿je +0 /2+=+z"/k 


Proof: 
See Stein, Complex Analysis, 2003 [St1]. 


We can apply this to £(s, x) because it has no poles or other singularities, 
and is hence an entire function. First, we compute its growth order, so we need 
estimates on each part of factor of £(s, x). By summation by parts, we have 
that 


L(s, x) =s | pt dx, where S(x) = 5 x(n), 


n<x 


valid for Re(s) > 0. If x is not the trivial character, we have |S(x)] < q, and 


|L(s, X)| <2q|s| for Re(s) > 1/2. 


Combining the above and estimates on I from Stirling's formula (lemma 
(2.2.1)), we have 


1 1 
leli] < ROA SIPS 8 + 50) 


< g2kels)+3 ¿Cls| los |s| 

when Re(s) > 1/2. The functional equation gives an identical bound when 
instead Re(s) < 1/2. Therefore we have that the order of growth of £(s, x) is 1. 
We have then by lemma (2.2.2) that 


£(s, x) = e*+8s][ (: 2 >) es, (2.7) 


p 


where A and B depend on y. We now use this and (2.5) to compute a 
formula for L'/L. Taking the logarithmic derivatives of (2.5) and (2.7), we have 


Ds) _ 5 lox 1 1T (3s + 34) BOJA L(+ *). (2.8) 


2 T(3s+ ża) 
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2.2.2 Several Estimates for L/'/L 


We take the negative real part of (2.8), and apply Stirling's approximation again 
(2.2.1), we have 


(ZER) = pied ge (REHE) reos re (+ 


<  cllog(q) + log(t + 2)) — Re(B(x)) — Re 2 ( DEP ‘)) 


a \s-P P 


where the approximation of T is valid in the region t > 2 and 1 < ø < 2. 
We can further simplify this if we analyze the constant B(x). We have from 
logarithmic differentiation of (2.7) and the functional equation, 


Because B(X) = B(x), we have that 


2Re(B(x)) = X` (Re (5) + Re 9) i 


P 


By the functional equation again, we may write 


Re(B(x)) = Y C + =) =— 5 Re (=) : (2.9) 


We use (2.9) to remove B(x) from our estimate of E'/L above, obtaining 


ae (Et 


1 
ew ) < c(log(q) + log(t + 2)) — ae ( ) : (2.10) 


sp 
This form will be quite useful in the next section when we find zero-free 
regions of L(s, x) for real x. It is also useful for the remainder of the present 


section. By the proof of lemma (2.1.7), we know that |L’/L| is bounded for 
Re(s) > 1, thus we have that 


1 
2 Re (=) < c(log(q) + log(t + 2)). 


Taking p = p + iy and s = 2 + iT say, we compute 
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E 1 ) 2-8 da 1 
e = E > 
Spy - AaB A 4+(T-7)? 
and hence find that 
1 
5 = Ollo t| + 2))). 2.11 
isa (log(a(lt| + 2))) (2.11) 
The above formula (2.11) has two important consequences. 


Observation 2.2.3. The number of zeros withT—-1l<y<T+1 is 
O(log(q([t] + 2))). 


Observation 2.2.4. The sum 


Y. = Ollog(a( lt] +2))). 


p=B+iy Cara) 
yv¢(T-1,T+1) 


We now restrict to s with —1 < Re(s) < 2. Using the partial fraction 
expansion (2.8) at both s and 2 + it, and subtracting the two yields 


L’(s, x) = 1 1 
OEN =00+5 ( sa): (2.12) 


sp 


In the spirit of the above two observations, we split this into the regions 
where the zeros p have either |y — t| > 1 or |y — t| < 1. For the first of these 
two cases we have 


1 1 2-0 3 
2 =- Y eoe aE Le paige Oat) 


0 G a 
p=P+iy P p=B+iy p=B+iy 
ly-t|21 ly-t|21 ly-t|21 


by observation (2.2.4). Observe that for terms with |y — t| < 1, we have |2 + 
it — p| > 1, thus we have reduced our estimate (2.12) 


II 


1 1 
Tay = ogedi E (3772) 


p=b+iy 
ly-t|<1 


Oloslal+2)+ Y) + Y 


p=P+iy p=P+iy 
ly—t|<1 Iy—t|<1 


IA 


II 


O(log(a(lt| +2) + SO aes (2.13) 
ae 
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where the last line follows from observation (2.2.3). 

By this point the reader may have lost track of what we were originally 
trying to prove. Referring back to the beginning of the section, we were looking 
for an appropriate bound on Imlog(L(s, x)) in the application of the argument 
principle for finding N(T, x). We now have such an appropriate approximation 
in (2.13). Thus, 


(Im log(L(s, x))) y = ow- f Im (2) af Im Ge 2) ds, 


HT L(s,x) 4-iT 


where the O(1) comes from the integral along [2 — iT,2 + iT]. Then 


(Im log(L(s, x)))y = O(log(a(|t1+2)))+ Y ([ (=) as f m 2 ) is) 


? ro s— 
p=6+iy +iT 3 MT Pp 
Iy—t|<1 


By application of the argument principle again to this simpler function, we have 
that both integrals are bounded by m. By observation (2.2.3), the number of 
terms in the sum is O(log(q(|t| + 2))), and hence we finally have 


(Imlog(L(s, x))) 4 = Ollog(q(1t] + 2))). (2.14) 


Hence the contribution to N(T, x) from L(s, x) is small compared to that of 
ES and I. Drawing these results together and recalling that H is only 


half of the full contour R, we have that 


N(T,x) = T log (Z) L + O(log(qT)). (2.15) 


2.3 Zero-Free Regions for L(s, x) 


The result of the above section is one important estimate which allows us to 
bound the side of Hp) in the explicit formula. However, we also need a bound 
on how close zeros may be to the line Re(s) = 1 to estimate ¿(p). In this section, 
the distinction between real and complex characters becomes significant, and we 
will see that the possibility of a zero on the real line close to s = 1 (a Siegel 
zero) cannot be ruled out, despite being able to otherwise establish a zero-free 
region for L(s, x) of width O(1/log(t)) elsewhere. Throughout we assume that 


x is not the trivial character unless otherwise noted. 


2.3.1 Regions for Complex x 
We begin with the trivial but useful inequality 
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o 
IA 


2(1 + cos(0))? 
3 + 4cos(0) + cos(20). 


II 


Next, if s = 0 + it and o > 1, compute from the Dirichlet series 


-Re (52) = so A(n) Re(x(m)je #28), 


L(s,x) ne 


We have that for (n,q) = 1, the modulus |x(n)e~“!°8™| = 1, hence 


n=1 


x(n)e~#loa(n) = e 


for some 0, and 
Re(x(n)e~# 128) = cos(0). 


Squaring or multiplying by e7* the first of these two likewise yields 
Relé (n)e 2" 98()) = cos(20) 


and 


Re(xo) = cos(0) = 1. 


Hence, by our trivial trigonometric inequality above, 


3 (9x0) _ yp, (Ple + tt.) L'(o + 2it, x’) 
L(o, Xo) L(o + it, x) L(o + 2it, x?) 


We have x? = xo if and only if x is a real character. This creates difficulties 
for us later and leads to the possibility of a Siegel zero. First, suppose that x is 
complex and primitive. The first term in (2.16) resembles the zeta function, 


) >0. (2.16) 


_L(0,xo) _ E x(n) 2 
L(o, xo) ne = C(c) ai” OM) 


n=1 


close to, but on the right of ø = 1. For the next two terms in (2.16), we appeal 
to the estimate (2.10) of the previous section, which states 


-Re (x) < c(log(q(t + 2)) — Y Re ( : ) (2.17) 


sp 


The terms in the sum are 


1 = 
Re ( ie g an 
sp) |s=ol 


therefore are able to omit some or all of the terms in the sum and the estimate 
will remain valid. Even though we assumed that x was primitive above, x? may 
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not be, although this does not affect the argument as we will show with a quick 
estimate. If xı induces x?, then 


L(s,x?) = L(s, x1) ] [0 — xi (P), (2.18) 
pla 


and hence 


L'(s,x?) _ L'(s, x1) Sy beste” 
L(s,x?)  E(s,x1) | ~ oe 
< S$ log(p) < log(q). 
pla 


This is within the bound in (2.17), so we need not concern ourselves with 
the possibility of x? being imprimitive. Given our above remarks about the 
positivity of the terms in (2.17), for x? we choose to omit the entire sum, hence 


L'(o + 2it, x?) 
L(o + 2it, x?) 


) = ottosta(t +2) 


Lastly, we bound the middle term. Given some zero p = PB + iy, we take 
t =. In the sum in (2.17) we keep only the term corresponding to this zero, 


L'(o + it, x) 1 
R 7 l t+2)). 
“(Foes pS pee )) 
Combining the estimates for all three terms of (2.16) we see that for the zero 
p we picked above, 


4 3 
< —— + c log(q(t+ 2)). 
o-=B c-l 
We have chosen t but are still free to choose o, and now take o = 1 + 
c3/(log(q(t + 2)) for an appropriately chosen c3. Applying this to the above 
equation gives 


C3 4c3 <1 C5 
log(g(t+2))  3log(q(t + 2))) + caca log(q(t + 2)) 


for appropriately chosen c;. We have this for any complex primitive charac- 
ter. But if x is imprimitive, the only extra zeros arise from euler factors as in 
(2.18), and accordingly have 8 = 0. Thus we have shown that 


B<14 


Observation 2.3.1. There exists a positive absolute constant cs such that, if 
x is a complex character mod q, any zero B + iy of L(s, x) satisfies 


C5 


P< Teal +2): 


(2.19) 
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2.3.2 Regions for Real x 


Now we proceed to real characters, which, as is common in the theory of Dirichlet 
L-functions, are much more difficult. Our estimates for L(ø, Xo) and L(o+tt, x) 
from the previous section still hold. However, we now have that x? = Xo so 
(2.17) is no longer valid. Because L(s, Xo) differs from ¢(s) only by a number 
of euler factors which divide q, we have again that 


ae C(s) 


O 


when Re(s) > 1. We have a similar formula to (2.17) for the zeta function (we 
omit its proof, but it is along the same lines) 


—Re (>) < Re 


log(t + 2 
aq tS og(t + 2) 


Thus 


Re Ge 28 a) ¿Be (==) Hera) 


L(o + 2it, x?) og —1+ 2it 


Since this is about as good as we can hope for, we use it along with our 
estimates of the other terms in (2.16) and obtain 


4 3 1 
$ < zai + Re (<a) + c7(log(q(t + 2))) 


where we take t = y and ø = 1+ 6/(log(q(t+2))) as in the case for complex 
X- 

However, we will now need an extra supposition to make our calculations 
work, and we postulate in addition that y > 6/(log(q(t + 2))). We obtain 


== < log(q(|7| + 2)) G l 5 | cr) 


ô 4 — 5c7ô 
A aman) fol 


where our postulate was necessary to obtain the second term in the first equa- 
tion. In order for the zero-free region we have just constructed to be inside 
the critical strip, we must have that the second parentheses is a strictly posi- 
tive quantity. We can obtain this if let 6 be sufficiently small compared to cz, 
say less than cg (which depends of course on previous constants). Note that if 
we had not assumed the extra postulate, we would not have had the 4 in this 
equation, and the second parentheses would have been strictly negative, giving 
a zero free region outside the critical strip, which is clearly useless. Therefore, 
our results are subject to the condition y > 6/log(q(|y| + 2)), or the slightly 
stronger condition y > 9/log(q). We have thus proved 


and 
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Observation 2.3.2. There exists a positive absolute constant cg such that, if 
0 <6 <cg, and x a real nontrivial character mod q, then any zero p = PB + iy 
of L(s,x) for which 


| | > ae 
= tog (q) 


satisfies 


ô 


ETTE 


We have not addressed imprimitive characters, however they can be dealt 
with by the same process as in the case of complex characters. 


2.3.3 Siegel Zeros 


It remains to address the region of the critical strip |Im(s)| < 6/log(q), where 
additional zeros may lie. We will show that all but one of these zeros are far 
from s = 1, in the sense that it has 8 < 1 — e/log(q) for suitable positive e. 
Then there exists at most one zero with 6 > 1 — e/log(q), hence it must be 
real (because otherwise we would also have it’s conjugate). This possible zero 
is called a Siegel zero. 

We assume there are two zeros with |Im(p)| < 6/log(q), and show that they 
must be away from s = 1. We assume that these zeros are complex and come in 
a pair 6 + iy. If one assumes there are two real zeros, the proof is similar and 
we omit it. Begin by returning to our favourite estimate (2.10) or (2.17), 


L'(0,x) 1 
== < cg log(q) — y 
Lo > (9) D 


valid for s = ø > 1. Recall that the terms in the sum are positive, so that 
we can drop arbitrarily many of them and retain a true statement. We drop 
all terms in the sum except for the two zeros whose existence we have assumed 
above. We now bound either side of this, obtaining 


Clo) -_Ll0,x) 1 2(0 — b) 
ETA Gay o Y AO gag < oed BR FF 
(2.20) 
We now take o = 1 + 2e/log(q), giving 
1 1 € 
50-8) > (0-1) = ¿> 


Combining the last two equations and simplifying, we obtain 


1 8 
pa? < C11 log(q) el 5(0— B) 
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from which it follows 


6 — 20c11€ 
5log(q)(1 + 2c11€) 


Hence if e is chosen sufficiently small, we have for a new e’ positive 
y ? p ti 


8x1 


1 


€ 
teeta). 


Therefore we have 


Observation 2.3.3. There exists a positive absolute constant cio such that, 
if 0 < e < Cig, the only possible zero of L(s,x) for a real, nontrivial x, and 
satisfying both 


hl < —, pioa 
log(q) log(q) 


is a single simple real zero. Such a zero is called a Siegel zero. 


and B>1- 


2.4 Siegel’s Theorem 


We have only one more zero to locate. All other zeros were bounded appro- 
priately far away from the line Re(s) = 1 in observations (2.3.1) and (2.3.2), 
however, we still need to address the possible Siegel zero. An appropriate bound 
of this zero away from s = 1 is given by Siegel’s theorem. 


Proposition 2.4.1 (Siegel’s Theorem). For any e > 0, there exists a positive 
number c(e) such that, if x is a real primitive character mod q, then 


cle) 


L(1,x) > E 


The result about the location of Siegel zeros which we are interested in and 
want to use follows from this as a corollary. 


Corollary 2.4.2. For any e > 0, there exists a positive number c' (e) such that, 
if x is a real primitive nontrivial character mod q, then L(s,x) 40 for 


/ 
s>1- a 
We prove the corollary first, and then proceed to the proof of the original 
theorem second. 
Proof (Corollary): 
Suppose we have a zero 8 which is close to s = 1 and hence contradicts our 
corollary. We want to apply the mean value theorem, so we have 
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for some c € [8,1]. We have that 


ccf elie! 


for large enough q. We now derive a bound for L’ in this slightly larger interval 
by a standard main term and tail argument. We have 


fene > SIA) 


for o > 0. We take o real and lying in the slightly larger region above. Split 
the sum at n = q. We first take the main term. We have that n < q and 
1—o < 1/log(q), hence 


which implies 


e 
no = e77 log(n) < el —log(n) = 


3 


Hence, we have for the main term that 


1 log(n)x(n 
D gln)x(n)| < 


ne 


n=1 


For the tail we have n > q and apply partial summation, 


S log(n)x(n) 6 ~ A 
DY |< loga) max] $O x(m)| = Ologla) 
n=q+1 m=q+1 


1 


Hence we have that for o € [1 — 9, 1), 


L' (ø, x) = O(log*(q)). 
We use these two facts to evaluate the right side of (2.21), giving 
¢ (e) log*(q) 
ce = 


This is a contradiction with Siegel’s theorem upon picking e correctly. Hence 
such a zero cannot exist. 


L(1,x) = (1 — B)L'(e,x) < 
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2.4.1 Goldfeld’s Proof 


We now proceed directly to the proof of Siegel’s theorem. We follow Goldfeld’s 
1974 proof [Gol] which employs the Dedekind zeta function of a biquadratic 
field. 


Proof (Siegel's Theorem): 

Let K be a biquadratic extension of Q. Let f(s) = Cx(s) = C(s)L(s, x1) L(s, x2) L(s, x1Xx2)- 
It has a pole at s = 1 due to the Riemann zeta function with residue A = 
L(1, x1) £0, x2) £1, x1x2). We pull out a quick 


Lemma 2.4.3. For every e > 0 there exists some x1 mod qı and a B satisfying 
l1—e< {<1 such that f(G) <0 independent of what X2 mod q2 is. 


Proof of lemma: 
We have two cases. First suppose that there are no zeros in [1—e, 1] for any 
L(s,x). Then L(s, x1), L(s, x2) and L(s, x1x2) are all positive in this interval 
but ¢(s) is negative, so f(8) < 0. In the other case, such a real zero does exist 
for some L(s, x1). Let 8 be such a zero of this L. Then f(8) = 0 regardless of 
what xa is. . 


Now we apply Perron’s formula, picking 


na. 
oa A) if n € [1,2] 
= 0 if n € [x,00). 


Note that ¢ is a C* cutoff function instead of C. This changes our Mellin 
decay properties lemma, however, does not affect the outcome of our argument, 
so we do not go into detail. Perron’s formula gives 


2+100 s 


1 zx 
da AP sett Det Det” 
oe _ £B) | ((am)'**a# 
-Ae Ra sl iB ey 


Our integrand has poles at 1 — 5, 0,—1,—2,-—3, and —4, and the first two 
terms in the second line represent the residues of these poles. By the lemma, 
we have that f(6) < 0, and we are free to pick x, so we choose one with 
(q192)?+* = O(x). We now need a crude bound on L(1, x). Recall Dirichlet’s 
class number formula for an imaginary quadratic field, 


h(=d) = p-wV (1, xa), 


where we have that w = 2,4,or 6, and xq is the real character given by the 
Kronecker symbol mod d. Because we know that the class number is always 
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positive, i.e. h(d) > 1, the formula gives that L(1, x) = O(d7*/?). These three 
facts applied to the above yield 


1<A 


"aad 
1-8 
where the bound on the Z-function given by the class number formula was used 
to estimate that A > 1/q1q2. That is to say, the error term in (2.22) is controlled 
under the conditions we have set forth. 

Using a main term and tail argument similar to (but simpler than) that 
in the proof of corollary (2.4.1), we also have for any nontrivial x mod q that 
L(1, x) = O(log(q)). Hence we have also that 


A < L(1, x2) log (91) log (9192). 
Pulling our last two equations together, we have 


clxa, qi, e) 
gO) Jog (qa) 


In actuality, x1 and qı only depend on e and hence the constant above only 
depends on e. If 0 < € < 1, then (2 + e)(1 — 6) < 3e, and 


cl) 
q% log(q) | 
The theorem then follows from this for sufficiently large q and upon relabeling 
é. 


L(1, x2) > 


L(1,x) > 


Although Siegel’s theorem does give us a nice bound on the size of the L- 
function at 1, the constants implied are unfortunately ineffective. That is, any 
proof of Siegel’s theorem to date (including the one above) does not give a way 
of assigning a numerical value to the constant c(<) for a given value of e. 


2.5 Final estimate for y(x, x) 


We how return to the ideas of section (2.1), having collected all of the estimates 
on zeros of L functions we need. Recall the explicit formula from section (2.1), 
x ee x log(x) log? (qT 
Examen) =- Y $) Y da —2m) + 0, (PACED) 
n 0 


[p[<T m= 


whenever ¢ is a C% function with compact support contained in (1, 00). 
In light of the last section, we pull out the possible Siegel zero from our 
explicit formula to later treat it separately. 


ht 


Br T 


(E log(x) log*(qT) ) 


Y x(n)A(n)d(n) =-=- SF blo) D5 b(a—-2m) +. Og 


[p[<T m=0 
(2.23) 
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We now want to pick ¢, calculate its Mellin transform, and apply the bounds 
of the previous three sections to the above. In principle, we would like to use a 
C* function which closely resembles the characteristic function of the interval 
(2, X], so that the left hand side of the above is a finite sum without a cutoff 
function. Because the left hand side is a discrete sum, choosing a cutoff function 
ġ so that the non-constant transitions between 0 and 1 lie between integers will 
be sufficient for obtaining the sum 


Y x(n) A(n) 


n<x 


on the left hand side. We choose the characteristic function of the interval 
[2—1/(2X), X +1/2], and mollify it using Mellin convolution. The convolution 
is defined with respect to the group (R*, -), hence 


ae f sdg) ©. 


Note the analogy with the familiar additive convolution. Just as in the 
additive case, (f * g) has the smoothness of either f or g, whichever is more 
differentiable. Again in analogy with the familiar theory of fourier transforms, 
we have that 


(fx 9)(s) = f(s) - G(s), 
where f is the Mellin transform of f. We now pick a smooth bump function 
na (u) of width 1/X, centered at 1, and with ||n||z1(R+,du/u) = 1. Therefore, 
For u € (0,1 — 1/(2X)) U (1 + 1/(2X), 00), na (u) = 0. We now pick our $ in 
the explicit formula, 


X E du 
ov) = (n * X[2-1/(2X),X+1/2]) (v) =i na (uv *)X[2-1/(2X),X-+1/2](U) a? 
0 


where x; is the characteristic function of the interval J. Note that by the 
above comments ¢ is a CS function. We now check that ¢ has the properties we 
are interested in. Consider the multiplicative translates of 7 4, as they appear 
in the integrand above. We have that 


x =i _ VU Vv 
suppu (ng (ww) = [o - zz .u4 52]. 


Because du/u is an invariant measure for multiplication, we have that any 
of these multiplicative translates has ||n(uv—')||r1(R+,du/u) = 1. In addition, 
given v for which 


supp, (n 


4 
x 


Gare E z sik + 5| 
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we have that the integral reduces to 1. However, combining the last two 
displayed equations shows that the set of such v is exactly [2, X]. Furthermore 
if 


1 1 
SUPP, (72 (uv) E — T J: 5| = 2, 


then the above integral vanishes. The set of such v is (0, $457) U ees ,00). 
Hence, we have that 


Observation 2.5.1. ¢ as defined above is a CS function with 


_ fo fue 0, E) UZ, o0) 
wf, foel X] -o 


(2.24) 


and having smooth transitions elsewhere. 


We now compute the Mellin Transform of ¢, which is conveniently enough 


an 


p(s) = (na * Xp-2,x+2))(5) F na (s) X= 4x4 21(5) 
= M (s) (= 2 e | 
< cn (s) - (= — =) : (2.25) 


To estimate the Mellin transform of our bump function, we must now appeal 
to our estimates of section (2.1). Recall from lemma (2.1.2) that the estimate 
applies for 4 supported in some interval [Y,2Y], with Y > 1. However, our 71 
has very small support, and we may take Y in the statement of the lemma to be, 
say, 2. If we set P(u) = nı (u/3), then @(u) has support contained in [2, 4], and 


we have that |@(s)| = 37|71(s)|. So we may apply our Mellin decay properties 
lemma (2.1.2) with m = 1, 


o-1 


lylo + it)| < Culos, 09) (lle + Y DD ro), 


and Y will be an absolute constant. For our bump function 7 1 we have 
mllz. = O(Y) and ||Dn]|z1 = O(1). Computing for our specific case we have 
90 
3ail 
Furthermore, for the sum over the nontrivial zeros in the explicit formula, 


we have 0 < ø < 1, so the (2)” factor is bounded. Taking p = 8 + iy we have 
for these zeros 


If (s)| < C1 (01,02) 
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173 (o)l = O(n"). 
Drawing these results together, we have for nontrivial zeros p 


ei (= E =) (2.26) 


For the trivial zeros, our Mellin transform estimates yield 
$(—2m + a) = O(X-2™48), 


Using our computations of the Mellin transform at the zeros of L(s, x), we 
may now fully compute (2.23). We have 


Oe a y E 2) Y grama dos (eee): 


=. perl x? os 
(2.27) 
We see immediately that 


contributes only a bounded amount as T — co, and that 


5 gO = O(log(x)). 
m=0 


Hence we absorb these into the error term, and have 


Br p 2 
Y xmAm|<-E- Y a LEO, (ae w + O(log(x)). 
nse lel<T 
(2.28) 
We are finally in a position to apply the estimates of sections (2.2), (2.3) 
and (2.4). Recall that from our conclusions at the end of all three subsections 
of (2.3), we have that all of the non-trivial zeros p = 8 + iy satisfy 


C 
B=1 ar) 


for some constant c. Hence for the numerator of (2.28) we have 


—_¿Jloglz) 
|| = zÊ < ze CRT), 


From section (2.2), we deduced that the number of zeros with imaginary 
part |y| < T was N(T, x)= O(Tlog(qT)). Hence we have that 
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1 r N(T T N(t log(qT 

Ye | ant) = BO 9 O PEA y o 8 4 ogg). 
5] P t T 

Iv[<T 2 o o 


Therefore 


qe —¿ 28) 
— z= O(zxe log(qT) ys (2.29) 
lle 


lol <T 


Taking this with equation (2.28) we obtain 


B1 2 log(« 
Y x(n)A(n)] < = + On aS a) b O(ze reta), (2.30) 


n<x 


We will apply Siegel's theorem to bound the final term above. However, first 
we impose a condition on q and pick T in terms of x to combine the error terms 
in the above. We suppose that 


q < log™ (x) 
for some positive constant M. In particular, this is stronger than 
q < log™ (x) RVR. 
We pick T similarly, setting 
T= ef log(x) 


Hence, our large error term reduces to 


Bı l 
Y mam) < -7 + O4(ae7 9 VOR), (2.31) 
n<x Br 
For the final step, we apply Siegel's theorem subject to our constraint on q. 
We have by the result of the corollary to Siegel's theorem in section (2.4) that 
for any € > 0 the exists c/(e) such that 


Ce 
By <1l- £ 3 
Hence 
dee CORA. 


Recalling our restriction on q from above, we take e = DT: getting q? < 
log(x) and 
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et < qe (My log(z). 
Therefore, we at long last have 


Lemma 2.5.2 (First Main Lemma). If x is a nontrivial character mod q and 
q < log" (x) for some positive M, then we have 


[V(r x)| = Olxe CM V8) (2.32) 


where C(M) is an ineffective constant which depends only on M. 


2.6 Ramanujan Sums 


Two lemmas are necessary to bound the major arcs M in the proof of Vino- 
gradov’s three primes theorem. The first was a bound on y(x, x) which made 
up the bulk of this chapter. We now give some information about Ramanujan 
sums, which is substantially easier than the preceding several sections. 

Recall that we have defined 


a modq 
(a,q)=1 


to be the Ramanujan sum. To prove the lemma, we utilize the Dirichlet convo- 
lution algebra, and write with fixed n 


AO OA R 


dlq dlg a mod q a mod q 
(a,q)=d 


We have that 1* u = id in the convolution algebra, so we convolve y with either 
side to obtain 


cy(n) = So dp (2) ; (2.33) 


From this expression we can see that c¿(n) only depends on q in terms of its 
divisors, and hence is easily seen to be multiplicative. This proves part (a) of 
the original lemma. 


It is now a simple matter to evaluate c¿(n) on prime powers. Let a be the 
largest power of p which divides n. Then we have that 


a [p? —p®* =o) if B<a 
Cpa(n) = Y pulp?) = 4 -p° if8=a+l1 
i=0 0 B>ifa+1 
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mod q 


where the last evaluation follows because u vanishes on proper prime powers. 
This proves part (b) of our lemma. 
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Chapter 3 


Minor Arcs 


In this chapter we investigate sums of the form >” u(nje(na) and find that 
estimates for these depend on the real number a. In particular they depend 
on how near a is to a rational number with small denominator - i.e. whether 
a lies in a major or minor arc. We develop this theory with an eye towards 
proving the third main lemma in chapter 1. We follow Green and Tao [GT] in 
this exposition. To do this, we establish an alternate formula for the y function, 
called Vaughan’s identity. This formula allows us to decompose sums of the 
form J` u(n)f(n) into two parts (type I and type II sums), each with a distinct 
behaviour. Type I sums tend to be large for periodic f and type II sums large 
for multiplicative f. We apply bounds on each of these sums to establish the 
result of the first chapter. 

In this chapter we adopt an “inverse approach” in proving statements, whereas 
in chapter 1 we apply our result in the forward direction. In the case of 
f(n) = e(an) above, we have that if a is close to a rational number with 
large denominator (major arc), then one of the type I or type II sums is large. 
On the other hand, if a is far from such a rational number (minor arc), the both 
the type I and type II sums are small. The “inverse approach” means that we 
will instead show that a type I or type II sum being large implies a lies on a 
major arc. This approach is easier to apply when one deals with more devious 


f. 


3.1 Vaughan's Identity 


Lemma 3.1.1 (Vaughan's Indentity). Let U, V, N be positive integers with 
UV < N. Then forn < N, we have 


an=- Y bot DY, soulo (3.1) 
be|n be|n 


b<U, c<V b>U, c>V 


Proof: 
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Observe that in the Dirichlet convolution algebra we have 1 * w* uw = u 
Writing out the definition of the Dirichlet convolutions this is 


p(n) =X ubao). 
bc|n 


We now split the summation into four ranges and label the corresponding sums 


Die da 


a. Sisk SEA 
II II II II 


Because of the symmetry in b and c, we see that }°, = 73. We also see 
that 5°, = — »,, indeed 


u(n) = X u(b)jule) = X ule) Y (0). 
be|n eln b| 2 


But the interior sum here is 0 unless n = c in which case it equals 1. Hence 
we have 


SY = HOO = Y wle)ben = 0 
1 2 be|n cjn 
e<V c<V 


if V <n. So —} = Ya = » 3, hence 


ln) == +>), 
1 4 


which implies the result. 


We now apply Vaughan's identity to an expression of the form 


OO) 


N<n<2N 


Using Vaughan's identity in the above one has 
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> anim = -E E mo OCHO 
N<n<2N N<n<2N be|n N<n<2N be|n 
b<U, c<V b>U, c>V 


= -Ti+ Tri. 


Where Ty and Tyr are the type I and type I sums we have referred to in the 
introduction to this chapter. If we take the type I sum, and make the change 
of variables d = bc and n = dw, we obtain 


n= Y au Y Ta (3.3) 


1<d<UV  N/d<w<2N/d 


where 


a= Y] uono. 
bc=d 
b<U, c<V 
We can similarly make a change of variables for the type II sum. Taking 
w = b and n = dw we obtain 


Ti = y 5 5 u(w)ba f (dw) (3.4) 


V<d<2N/U max(U,N/d)<w<N/d 


where 


ba = = plc). 


cld 
c>V 


These sums allow us to investigate sums of the form >> p(n) f(n). It is easy 
to see that if this sum is large, then either Ty or Ty is large. The forms (3.3) and 
(3.4) of the type I and II sums, given by Vaughan’s identity allow us to analyze 
their respective sizes. We simplify the expressions and make this precise in the 
following 


Proposition 3.1.2. Let U,V,N be positive integers with UV < N, and let 
f:N=C be a function with ||fl|= = O(1) such that 


Y unfa) > 6 


N<n<2N 
for some ô > 0. Then either 
e Tr is large. There exists an integer 1 < D < UV such that 
1 ô 
F >o f(dw) > 
N/d<w<2N/d 


(3.5) 


43 


for > 6?Dlog~°(N) integers d such that D < d < 2D. 


e Ty, is large. There exist integers D, W with V/2 < D <4N/U and N/4 < 
DW < 4N, such that 


1 A A 64 
DW f (dw) f dw’) f (dw) f(d'w')| > — 3 
DW oe ee . i É x log'*(.N) 

(3.6) 


Proof: 

We take N to be large. We then have that either |Ty| > 6/2 or |Ty| > 6/2. 
We first deal with the case that the type I sum is large. With reference to the 
form (3.3) for Tj, we have the trivial estimate 


lal < Y HOOS 1=r(a). 


be=d bjd 
b<U, c<V 


Which allows us to estimate the type I sum, 


1 
skm X a) 


d |N/d 
1<d<UV N/d<w<2N/d 


f(dw))|. 


Regarding the outer sum as an inner product, we may apply the Cauchy- 
Schwarz inequality and obtain 


<| Y ET S faal Y na 


1<d<UV N/d<w<2N /d 1<d<UV 


To compute the second sum, we observe that 


Tn) _ CAs) 
2, n ¢(28)’ 


which by a quick application of Perron's formula yields 


Hence 


1 1 9? 
2 aN OO) aaa 


1<d<UV N/d<w<2N/d 


Dividing the outer region of summation 1 < d < UV into dyadic blocks 
D < d < 2D allows us to remove the d~! from the above. Applying the 
pigeonhole principle we obtain 


1 62D 
5 Nya 5 f(dw)| > og (N) 


D<d<2D N/d<w<2N/d 


for some D with 1 < D < UV. Recall that at the outset we specified ||f|| z= = 
O(1), so the whole summand of the outer sum is also O(1). Therefore, not too 
many summands can be too small, and this sort of averaging argument shows 
that 


1 ô 
ey NS 
N/d N/d<w<2N/d logP(N) 


for at least > ô? D log *(N) values of d. This proves the proposition for type I 
sums. 

Now we suppose that |7i1| > 6/2. Much like with type I sums we see that 
Iba] < T(d), and from formula (3.4) we have 


Nô 5 T(d) 5 XU,2Ên/ql(w)ulw) f(dw)} . 


V<d<2N/U N/d<w<2N/d 


Again, we think of this as in inner product and apply Cauchy-Schwarz, 


we<[| Y O)| E a| E xwansa twat) tav) 


V<d<2N/U V<d<2N/U |N/d<w<2N/d 


and our estimate for X` r?(d)d7", 


N?8? 


log HN) $ d > X(u,2n/a](w) u(w) f (dw) 


V<d<2N/U  |N/d<w<2N/d 
Once again, we use a dyadic decomposition to remove the d. There exist 
integers D,W with V/2 < D <4N/U and N/4 < DW < 4N such that 
2 
N?6? 
ee Y xs(w)u(w) f (dw) 
5 
Dlog?(N) D<d<2D |W<w<2w 


Where J = (U,2N/d] N (N/d, 2N'/d]. We want to remove the characteristic 
function x of the interval from the above expression. For this we will need a 
technical 
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Lemma 3.1.3 (Completion of Sums). Let I C Z be a discrete interval, and 
f:Z— C be a function. Then we have 


sup] JD f(n)| <log(1 +11) sup | > £(m)e(om)] 


=" net ae nel 


where the supremum on the left ranges over discrete sub-intervals of I. More 
generally, if I’ C Z is another discrete interval, and K :ZxZ — C is a function, 
then we have 


Y 1x5, (m)K (n,m)? < log?(1+[Z]) sup Y 1) Kn, melan)? 


mel” nel o0ER/Z mer nel 
where for each m € T', Jm C Z is an arbitrary discrete interval. 


Proof: 

We apply some discrete Fourier analysis. Assume that / is nonempty and 
translate the interval to J = {1,..., L}. We identify this interval with Z/LZ so 
that we may use Fourier analysis. For any subinterval J of I we can expand 


So fm) = Soxs(n)F(n) 


neJ nel 
= ZVOZ 1%) 
EEZ/LZ nEZ/LZ 


Recall that the finite fourier transform is defined 


vo=-} Y vwe(-%) 


nEZ/LZ 


hence we can crudely approximate it as a geometric series; it is then easy to 
sum and write a good bound for. We have 


1 
v 4min Tue TU)" 
IX (E)] < (i ont) 


We apply this bound with the triangle inequality to the above. 


1 n 
| fa < 4 min (1. aii f(nje ($) 
2. 2 * L|€/L||eyz ps L 
1 
< min (i ) sup f(nje(na) 
on L|€/L||p/z ee 
< log(L+1) sup ` f(n)e(na). 


acR/Z nel 
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This proves the first part of the lemma. The second part of the lemma 
follows by essentially the same argument, but working in /?(Z) instead. We 
have 


Lx (Mk < [203 min (. ae) > Homie) j 
nEZ/LZ 


mel’ nel mel” €€Z/NZ 
2 
1 nE 

[av Keme (8) 
ane ( aeae (DIE EPUA T 

€ E 

< log(L+1) sup (y IX K(n, mje ($) e) f 

oER/Z mel’ nel L 


Hence the second part of the lemma. 


Returning to the original proposition we were proving, we take K(w,d) = 
p(w) f(dw), and applying the second form of the lemma we obtain 


2 
Oe < Y SY plo) f(dw)e(we) 
log" (N) D<d<2D [W<w<2W 


for some a € R/Z. We expand the right hand side as 


u(w, w) f(dw) f(dw’), 

W<w, w"<2W D<d<2D 
where u(-,-) is a bounded function whose exact form we do not worry about. 
The proposition follows from the Cauchy-Schwarz inequality on the Gowers U? 
norm. 


3.2 Orthogonality of u(n) and e(na) 


We now apply the results of the preceding section to the special case where 
f(n) = e(na). Throughout the notation ||a||p/z is used to denote the distance 
of a to the nearest integer. In our results, we wish to say that >” u(nje(na) is 
small if a is far from a rational number with small denominator (minor arc). 
However, in keeping with our “inverse” philosophy, we will prove instead that if 
the sum is large, then a must be in a major arc. These are, of course, logically 
the same thing. We now approach 


Proposition 3.2.1. Let a € R, let A > 0, and let N be a large integer such 
that 


1 


ly Y aln)e(-na)| > log" 4(). (3.7) 
N<n<2N 
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Then there exists D, 1 < D « N?/3, such that 


D 
#{1 < d < 2D: |lod|layz < 5 log*4+14(N)} > Dlog™t^ (N). (3.8) 


Proof: 

Apply the previous proposition (3.1.2). For parameters U and V we pick 
U = V = N?. The proposition says that either the type I or type II sum is 
large. We will show that either of these possibilities yields (3.8). Specifically, 
the proposition gives that one of the following holds, 


e Ti is large. There exists an integer 1 < D < N 3 such that 


1 
e(adw)| > 


e 3.9) 
A+5/2 ( 
N/d<w<2N/d log / (N) 


z| = 


for > Dlog-*4~°(N) integers d such that D < d < 2D. 


e Ty is large. There exist integers D, W with N3 <D<N3 and N/8 < 
DW < 8N, such that 


1 
e(a(dw — dw’ — d'w + d'w'))| > 


7. 4A4147 a7) 
D<d,4<2D W<w,w'"<2W log (N) 
(3.10) 


1 
DW 


We begin with assuming that Ty is large. Recall our bound on a geometric 
series from the above lemma. If I is any interval in Z, then 


[Y e(na)| < 4min ( 1 ) 


Mh Tal 
nel R/Z 
Hence, there are > Dlog 72435 (N) values of d, D < d < 2D, for which 
D 
llod|lnyz < 5 108^"? (N), 
which gives (3.8) with room to spare. 


Now we assume instead that Tyr is large. By the pigeonhole principle we can 
find d', w’ such that 


1 1 
d e La d' f 
— > > e(a(dw — dw w+duw’))| > i 


AFI an * 
D<d<2D W<w<2W 08 (N) 


Now using the triangle inequality and reducing two factors to trivial we have 
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1 1 
OR Y) elaw(d-d'))| > — ¿un 
2 D<d<2D |W<w<2W log (N) 


Applying our geometric sum estimate again gives 


1 `; . ( D ) z 1 
— min | 1, y 
D pD Nlla(d - d')|lr/z log*^ T(N) 


Averaging, we have 


D 
#{D <d < 2D: |la(d—d)|la/z < 5 logt (N)} > Dlog~*4-4(N). 


We now make the change of variables d=d-—d', and obtain 


7 D 
#{-2D < d < 2D: ||adlrz < 5 log (Ns Doa (N). 


Because D > N3, we may remove the case d = 0 and because Fllrsz = 
|| — Bllryz, we have symmetry in the above expression and hence the result. 


We now have a proposition about the size of our sum >> y(n)e(na) in relation 
to where a is. However, the conclusion of the proposition does not make it 
entirely obvious under what conditions our exponential sum is small. We see 
that it involves a major arc / minor arc condition, for if a is close to a rational 
number with denominator d then ||ad||p,/z is small. It turns out that the size of 
the denominator can be made much smaller than this. However, to make this 
precise, we need some tools from harmonic analysis. 


3.2.1 Some Harmonic Analysis Tools 


In this section we develops some technical lemmas along the lines of Montgomery 
[Mo] and Green and Tao [GT]. We begin with some definition and state Vaaler’s 
Lemma, from which one deduces the Erdós-Turán inequality. This inequality 
allow us to prove an important lemma relating recurrent linear function to the 
major arcs of our proof, and in turn proves the more precise version of the above 
proposition. 

We begin by defining the saw-tooth function: 


_ J{z}-4 ife€Z 
y= fh ifzezZ. 


And recall the definition of the Fejér kernel, 


Ag(z)= > (1 = w) e(kx) = EIA (3.11) 


-K 
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Then we have 


Definition 3.2.2 (Vaaler’s Polynomial). Vaaler’s polynomial is 


K 


Vx (a) : 5 ( E 5) Ak+1 (e E ) E D sin(2r(K+)2)-—Ar(2) sin(27x). 


AL 


ry K+1 


K+1 
(3.12) 


V(x) is a good approximation to the the saw-tooth function s(x), we have 


: 1 
V(x) = s(x) +O (min (1 EE) f 


Another possible approximation to s(x) is the 
Definition 3.2.3 (Beurling polynomial). The Buerling polynomial is 


Bx(x) = Vg(x) + g gôr) (3.13) 


The relationship between these functions is given by 
Lemma 3.2.4 (Vaaler’s Lemma). If0 < x < 1/2 then s(x) < Vg(x) < Br(z), 


while if 1/2 < x < 1 then Vx(x) < s(x) < Bx(ax). If T(x) is a trigonometric 
polynomial of degree < K such that T(x) > s(x) for all x then 


1 
i Tate 2(K +1) 


with equality if and only if T(x) = Bx(a). 


Proof: 
The proof of Vaaler’s lemma is a long series of calculations involving ap- 
proximations to the derivatives of the functions involved. For full details see 
Montgomery [Mo]. 


We now make a few more definitions. Given a sequence {un} of points 
U, € R/Z, and 0< a < $< 1, define 


Z(N,a, 8) = 5 1 (3.14) 
n<N 
asu, <B 


Then the sequence {un} is uniformly distributed if 


1 
sim 4 (W, a, 6) =B-a 


for every choice of a, 8. If we let Uy be the measure on R/Z that places a 
point mass at each point of {un}, then we may take the fourier transform of 
this measure, 
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(3.15) 


I 
Mza 
De 
x 
= 


If we define 


D(N, a, B) = Z(N,a,8)—(8—-—a)N 


we can define the discrepancy 


D(N) = ||D(N, a, 6)l| => (a,3)- 


The discrepancy, in essence, measures how close {up} is to being uniformly 
distributed. It is easy to see that if D(N) = o(N) is equivalent to the se- 
quence being uniformly distributed. We can apply Vaaler’s lemma to study the 
discrepancy, however, we need one more 


Definition 3.2.5 (Selberg Polynomials). The Selberg polynomials are given by 


Si(x2) =P=a+ Bx(1— PB) + Bx(a— zr) 


and 


Sl) =P-=0a-—Bx(PB-x)- Br[x—-a). 
Now we are ready to state and prove 


Lemma 3.2.6 (Erdós-Turán inequality). For any positive integer K, 


(3.16) 


Proof: 
We begin with Vaaler’s lemma which gives that —Bx(—-x) < s(x) < Br(a) 
for all x. Because 


xs lc) =p -a + s(x — p) + s(a -— 2), 


we see that the Selberg polynomials are trigonometric polynomials of degree 
at most K which approximate the characteristic function of the arc J = [a, PB] € 
R/Z. By applying Vaaler’s lemma to the definitions of the Selberg polynomials, 
we have that Sz(x) < x7(%) < S£(x) for all z and that 
1 


Sel) dx = 8 — a + ——. 
R/Z g(r) r=p-a Kai 


Now we approach the discrepancy. Fourier inversion and our facts concerning 
the Selberg polynomials give that 


N 
Z(N, a, 8) = XO xg (un) 
"N 
< Y SE(un) 
N 
= Y) Dd, Si()e(kun) 
n=1 k=-K 
K ~ 
= Y BEmÓdN e) 
=_K 


We have symmetry in k, and clearly by the above deductions that Un (0) = N 
and $£(0) =8-—at gy Hence we arrive at 


N 
K+1 


D(N, a, 8) < + Y S£(k)Un(-k). 


O<|k|<K 


We must now estimate the summand. We employ the estimate |f(k)| < 
\|f||z1 to the function S$ (x) — xg(x). Thus we calculate 


1 


ER - RIWI < 15% —xlla = 3: 


It is a introductory exercise to compute 


_ elka) —e(k6) 


Zlk 
Xg (k) T ; 


therefore 


x(k) = 


when k 4 0. Combining this with the above we have 


te) ena (raz) 


SE) < z i ens (o z ) (3.17) 


mr) 


This allows us to evaluate our discrepancy estimate 


N 1 . 1 A 
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A analogous argument produces a lower bound on D(N, a, 8) using St (k). 
Finally, observing that 
1 1 3 


FE’ ae oe. 


we have the estimate on the discrepancy D(N) in the lemma. 


We will apply the Erdós-Turán inequality in the proof of another lemma, 
which will be more useful to us, 


Lemma 3.2.7. Let I C Z be an interval, let a E€ R/Z, and suppose the set 


£={1ET:|lalllayz < 61} 


is of size at least 52|I| for some 0 < 6), 62 <1 with $1 < 302. Then 
1. If |I| > 1/92, then infi<a<s/s, ||od||R/z < a 


2. If |I| > 2/03, then infr<a<16/57 llad|lmyz < 25% 


òg 
Proof: 
Take I to be{ M +1,...,M + L}. Define a sequence (up, as 
u=a(M+l) mod 1. 
The lower bound on the size of £ in the hypotheses gives us a estimate on 
the discrepancy 
1 
D(L, —91,01) > 69 — 26,)L È> 5 oak. 


Comparing this with the estimate from the Erdós-Turán inequality, we have 
that 


L Ae 
< | 
bal < q t32 em) 
k=1 % |I=1 
for any K. We pick K = EAE Hence, there exists k < 5 so that 
93L 
S e(ku) > Pa 
l=1 


Now, we use the estimate for a geometric sum 


|X e(na)| < 4min (in, a 


2 llalley2 


from the first half of this chapter, rearrange, and part 1. of the lemma follows. 
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The estimate 2. is much stronger, and we use a bootstrap method to work 
2 
1. into 2. Assume 6; < È because otherwise 1. already implies the result. Let 


1 < m < L (which will be chosen later), and define 


La ={b4+1,...,b+m}N£. 


By the lower bound for £ given in the hypotheses, we apply the pigeon-hole 
principle to find that there exists some b so that |Ly| > Sam Fix this b. Now 
we consider the set m£ + Ly. By our above estimate we have that 


L 
mL + Lo] > Pet 


and we also have that 
ML +L CI’ ={m(M4+1)+b4+1,...,m(M+L)+b4+m}, 


and |I| < mL. Finally, for any x € ML + Ly one will have ||aa||p/z < 20m. 
We can now apply part 1. of the lemma with I — I’, 6, > 2m901 and 62 — 63/2 
provided we satisfy the conditions of the lemma. If we set m = |03/169, |, we 
have m < 63/166, and mL > 2/63 , so we can apply 1. and the result follows. 


Given this lemma, we now have to tools at our disposal to return to propo- 
sition (3.2.1) regarding the orthogonality of the móbius function and a linear 
phase giving us a major arc condition. We are prepared to make the proposition 
more precise with our most important 


Corollary 3.2.8. Leta ER, let A> 0 and let N be a large integer such that 


l ran 
I= Y wln)e(—na)| > log™^(N) 
N<n<2N 
holds. Then 
log28(4+4) (N) 
me dllajz. <= 3.18 
ie ay Il;z N ( ) 
Proof: 


We apply proposition (3.2.1). Therefore we have that there exists a D with 
1 < D < NÈ such that (3.8) holds. If D is less than log? 444) (N), then we 
have he corollary directly from (3.8). If D > log’“4+* (N), then the argument 
in somewhat more difficult. Observe that if we have a linear sequence al which 
has small circle norm for many / in an interval J, then a must lie on a major arc. 
To see this, we apply the lemma just proved in the case that D > log? 4+4 (N ), 
we have our interval J = [1,2D]. We pick the parameters 6, and d2 to be 
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D 
6 < ÁÚlogH4+9(N) 
N 
1 


log*(4+4) (N) 


dn < 


Then, conclusion (2) of the lemma gives the corollary. 


From the corollary, we take the contrapositive. Extending our sum from 
[N,2N] to [1, N], one picks up at most an extra factor of log( N), and obtains 


Lemma 3.2.9. Leta € R, and C > 0. If for sufficiently large N, and any fixed 
k we have 


log?8(C+4)(N) 
inf d > k 22, 
EE eas [Ida] |rz > N 
then 
1 _ 
ly SY nínje(na)| = O(log” + (N)), 
1<n<N 


which was our third main lemma from the introductory chapter. 
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Chapter 4 


Conclusion 


Drawing together the results on chapters 2 and 3 and their application in chapter 
1, we prove Vinogradov’s theorem on sums of three primes. From the proof 
given, one sees two main ingredients. The first is the prime number theorem for 
arithmetic progressions. That is, the explicit formula given by Mellin transform, 
and the classical zero-free regions. Second is a theory for the treatment of 
exponential sums, which in this essay is given by Vaughan’s identity and many 
analytic estimates, proving that the Mobius function is orthogonal to linear 
phases. This orthogonality, of course, is related to the Mobius randomness 
law, and hence is related to zero-free regions of £-functions and the Riemann 
hypothesis. All together, the proof draws on a wealth of deep and interesting 
mathematics. 

Unfortunately, Vinogradov’s result does not show that all odd integers are a 
sum of three primes, but rather the weaker statement that all sufficiently large 
odd integers are a sum of three primes. To determine exactly how large suffi- 
ciently large is one would have to investigate more carefully the constants in the 


log*(N) 
bound on how far a Siegel zero may be from s = 1) is invoked in the proof of 
Vinogradov’s theorem, and hence determining the constants implicit in the O 
term is quite difficult. 

Some progress has been made in this area. In 1956 Borodzkin showed that 
Vinogradov’s theorem holds for all odd n > 33. In 1989, Chen and Wang 
reduced the number to 1048000, which is still unfortunately beyond the range of 
a computer check. See Ribenboim, [Rib]. As an area for future research, one 
would like to have a more useful way of making constants in Siegel’s theorem 
effective, and get these bound down to reasonable sizes. 


O (un) term. Unfortunately, Siegel’s theorem (which gives an ineffective 
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